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Coronaviruses express two very large replicase polyproteins, the 16 autoproteolytic cleavage products of 
which collectively form the membrane-anchored replication complexes. How these structures are assembled is 
still largely unknown, but it is likely that the membrane-spanning members of these nonstructural proteins 
(nsps) are responsible for the induction of the double-membrane vesicles and for anchoring the replication 
complexes to these membranes. For 3 of the 16 coronavirus nsps—nsp3, nsp4, and nsp6—multiple transmem- 
brane domains are predicted. Previously we showed that, consistent with predictions, nsp4 occurs in mem- 
branes with both of its termini exposed in the cytoplasm (M. Oostra et al., J. Virol. 81:12323-12336, 2007). 
Strikingly, however, for both nsp3 and nsp6, predictions based on a multiple alignment of 27 coronavirus 
genome sequences indicate an uneven number of transmembrane domains. As a consequence, the proteinase 
domains present in nsp3 and nsp5 would be separated from their target sequences by the lipid bilayer. To look 
into this incongruity, we studied the membrane disposition of nsp3 and nsp6 of the severe acute respiratory 
syndrome coronavirus and murine hepatitis virus by analyzing tagged forms of the proteins expressed in 
cultured cells. Contrary to the predictions, in both viruses, both proteins had their amino terminus, as well as 
their carboxy terminus, exposed in the cytoplasm. We established that two of the three hydrophobic domains 
in nsp3 and six of the seven in nsp6 are membrane spanning. Subsequently, we verified that in nsp4, all four 
hydrophobic domains span the lipid bilayer. The occurrence of conserved non-membrane-spanning hydropho- 
bic domains in nsp3 and nsp6 suggests an important function for these domains in coronavirus replication. 


Positive-strand RNA viruses induce the formation of cyto- 
plasmic membrane structures in their host cells to accomplish 
the efficient replication of their genomes. These structures 
probably facilitate the orchestration of the replication process 
and the recruitment of the components required for RNA 
synthesis and may shield the RNA intermediates from recog- 
nition by the host cell’s defense mechanisms. The membranes 
of these structures can be acquired from different cellular 
compartments. In many virus families, such as Picorna-, Flavi-, 
and Bromoviridae, the RNA replication complex is associated 
with membranes derived from the endoplasmic reticulum 
(ER). However, endosomes and lysosomes (Togaviridae), per- 
oxisomes and chloroplasts (Tombusviridae), and mitochondria 
(Nodaviridae) are also used as membrane donors (for a review, 
see reference 44). In general, viral nonstructural proteins 
(nsps) are responsible for the assembly of the replication com- 
plex at these specific cellular organelles and for the observed 
membrane rearrangements. 

Coronaviruses are enveloped, plus-strand RNA viruses belong- 
ing to the family Coronaviridae that, together with the Arteri- and 
Roniviridae, belong to the order Nidovirales. With sizes ranging 
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between 27 and 32 kb, coronaviruses possess the largest genomes 
among all known RNA viruses. The 5’ two-thirds of the corona- 
viral genome is occupied by open reading frames (ORFs) that 
encode the viral replicase. The remaining part of the genome 
codes for the structural proteins, which invariably comprise at 
least the spike (S), envelope (E), membrane (M), and nucleocap- 
sid (N) proteins, and for a variable number of accessory proteins. 
Except for the replicase ORFs, all genes are translated from 
subgenomic mRNAs, which are generated by a process of discon- 
tinuous transcription (for a recent review, see reference 45). The 
viral replicase is encoded by the two most 5’ ORFs on the 
genomic RNA, orfla and orflb, which are translated into two 
very large precursor polyproteins, ppla and pplab, comprising 
approximately 4,000 and 7,000 amino acids, respectively. pp1ab is 
translated only after a —1 frameshift induced by a slippery se- 
quence at the end of orfla, which only occurs in a fraction of 
translational events (5). 

The replicase polyproteins are processed by virus-encoded pro- 
teinases to produce 16 mature nsps (19, 63). nsp1, -2, and -3 are 
released through cleavage by a papain-like proteinase, one or two 
functional copies of which are contained within nsp3; all other 
cleavages are performed by the 3C-like main proteinase located 
in nsp5 (2, 11, 17, 55, 64). ppla, which contains the first 11 nsps, 
including the proteinases, is three to five times more abundantly 
produced than pplab, which additionally gives rise to nsp12 to 
nsp16 (13). Functions involving the actual replication and tran- 
scription of the viral genome have been assigned to several of the 
orflb-encoded nsps, like RNA-dependent RNA polymerase 
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FIG. 1. ppla transmembrane domain prediction based on multiple alignment. The presence of transmembrane domains in the ppla precursor 
was predicted (51) on the basis of a multiple alignment of 27 different coronavirus sequences with representatives from each of the different groups 
(RefSeq in the NCBI CoreNucleotide database). The resulting hydrophobicity plot is shown in the upper panel, with peaks reaching the threshold 
(dotted line) representing predicted membrane-spanning domains. The black line in the middle panel represents the protein and shows, based on 
the known topology of nsp4 (37), its predicted localization on the lumenal or cytoplasmic side of the membrane, which is symbolized by the gray 
bar. At the bottom, a schematic representation of ppla is shown in which the regions containing the putative transmembrane (TM) domains, the 
papain-like protease (PLpro, two in MHV and one in SARS-CoV), and the main protease (Mpro) are highlighted. The protease cleavage sites are 
indicated by arrowheads, with the nsp3-encoded PLpro cleavage sites in gray and the nsp5-encoded Mpro cleavage sites in black. 


(RdRp) activity in nsp12, helicase activity in nsp13, exonuclease 
activity in nsp14, endoribonuclease activity in nsp15, and methyl- 
transferase activity in nsp16 (3, 7, 9, 25-27, 43, 63). Except for the 
proteases, the functions of the orfla-encoded nsps are less clearly 
defined. ADP-ribose-1"-monophosphatase activity has been iden- 
tified in nsp3, whereas in nsp8 a second RdRp activity was dis- 
covered (24, 42). nsp7 and nsp8 together were shown to form a 
hexadecameric complex able to bind nucleic acids and suggested 
to function as a processivity factor for the RdRp (62). nsp9 and 
nsp10 were also found to bind nucleic acids, while nsp10, which 
contains two zinc finger domains, might additionally be somehow 
involved in the processing of the polyproteins (12, 31, 52). 

The nsps collectively assemble into membrane-associated 
complexes that constitute the sites of de novo viral RNA syn- 
thesis (47, 58). The virus-encoded N protein and possibly sev- 
eral cellular proteins are also recruited to these sites (4, 46). 
The replication complexes are found in the perinuclear region 
of the host cell anchored to double-membrane vesicles 
(DMVs) (6, 20, 48). The origin of the membranes in these 
structures has not unambiguously been established. Several 
cellular pathways and organelles, such as the ER, Golgi com- 
plex, endosomal/lysosomal system, and autophagic pathway, 
have been implicated in the formation of the replication com- 
plexes (40, 47, 48, 57). However, recent studies indicate the ER 
to be the most likely lipid donor compartment (37, 48). 

Essentially nothing is known about how the membrane-an- 
chored replication complexes are induced and assembled and 
how the individual nsps and other necessary constituents are 
recruited to these sites. Coronavirus orfla encodes three 
nsps—nsp3, nsp4, and nsp6—that are predicted to contain 
transmembrane domains. It is likely that these proteins not 
only function in the membrane anchoring of the multisubunit 
replication complex but also induce the formation of the mem- 
brane structures. The membrane association of mouse hepati- 
tis virus (MHV) and severe acute respiratory syndrome-asso- 
ciated coronavirus (SARS-CoV) nsp3 has been demonstrated 
previously (22, 28). In addition, the membrane association, as 
well as the topology, of nsp4 of these two viruses has been 


resolved (37). The involvement of these nsps in the formation 
of replication complexes is supported by the fact that for the 
related arterivirus equine arteritis virus (EAV), coexpression 
of the counterparts of nsp3 and nsp4, ie., nsp2 and nsp3, is 
sufficient for the induction of DMVs (49). Furthermore, mu- 
tations in MHV nsp4 or EAV nsp3 impaired DMV formation 
(8, 39). 

In this study, we wanted to fill a gap in the existing knowl- 
edge of coronaviruses by establishing an experimentally veri- 
fied topology model of the complete coronavirus replicase 
polyprotein. When transmembrane predictions (51) were per- 
formed on the entire orfla region based on the multiple align- 
ment of 27 coronavirus sequences, with representatives from 
each of the different coronavirus groups (present as reference 
sequences [RefSeq] in the NCBI CoreNucleotide database), 
some discrepancies were observed (Fig. 1). For nsp4, four 
transmembrane domains were predicted, which is in agree- 
ment with our previous experimental data that showed that 
both termini are located on the cytoplasmic face of the mem- 
brane (37). However, for both nsp3 and nsp6, an uneven num- 
ber of transmembrane domains was predicted, three and seven, 
respectively, resulting in a model in which the proteinases, 
present in nsp3 and nsp5, would be separated from some of 
their target sequences by the lipid bilayer. Furthermore, this 
prediction places nsp1, nsp2, and most of nsp3 on the lumenal 
side of the membrane, which is unlikely as ppla lacks an 
amino-terminal signal sequence. In addition, several studies 
have shown that nsp1 and nsp2, as well as a number of nsps 
downstream of nsp6, localize to the cytosol (6, 21, 22, 57). To 
solve these discrepancies, we focused on the topology and 
membrane integration of the hydrophobic nsps, particularly 
nsp3 and nsp6. To strengthen our conclusions, the nsps of both 
MHV and SARS-CoV were studied. Contrary to the predic- 
tions, in each virus both nsp3 and nsp6 appeared to have the 
amino terminus as well as the carboxy terminus exposed on the 
cytoplasmic side of the membrane. We show that in both nsp3 
and nsp6, not all of the predicted transmembrane domains are 
used as such. In nsp3, only two, and in nsp6, only six of the 
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predicted transmembrane domains actually span the lipid bi- 
layer. These results raise the question of why coronaviruses 
have conserved hydrophobic domains in their nsps that do not 
function as membrane-spanning domains. 


MATERIALS AND METHODS 


Cells, viruses, and antibodies. OST7-1 cells, obtained from B. Moss (16), were 
maintained as monolayer cultures in Dulbecco’s modified Eagle’s medium 
(DMEM) (Cambrex Bio Science Verviers) containing 10% fetal calf serum 
(FCS) (Bodinco B.V.), 100 IU of penicillin per ml, and 100 wg of streptomycin 
per ml (referred to as culture medium). Recombinant vaccinia virus encoding the 
bacteriophage T7 RNA polymerase (vTF7-3) was also obtained from B. Moss 
(18). 

Rabbit polyclonal antisera directed against the enhanced green fluorescent 
protein (EGFP) or the influenza virus hemagglutinin (HA) tag were obtained 
from ICL. Rabbit antiserum recognizing the C-terminal domain of the MHV 
membrane (M) protein (anti-Mc) has been described previously (30), while 
mouse monoclonal antibody against the amino terminus of MHV M (J1.3 or 
anti-My) was provided by J. Fleming (53). 

Plasmid constructions. First, a plasmid was created in which all gene frag- 
ments could be cloned behind a T7 promoter in frame with the sequence en- 
coding the EGFP tag. To this end, the pEGFP-N3 vector (Clontech) was di- 
gested with EcoRI and NotI, of which the latter restriction site was filled in with 
Klenow polymerase (Invitrogen) and this fragment was cloned into the EcoRI- 
and BamHI-digested pTUG31 vector (59), of which the BamHI restriction site 
was also filled in with Klenow polymerase (Invitrogen), thereby creating pTug- 
EGFP. An N glycosylation site was created in the EGFP gene by performing 
site-directed mutagenesis with the QuikChange site-directed mutagenesis kit 
(Stratagene) according to the manufacturer’s instructions on pEGFP-N3 with 
primers 3212 and 3213. This mutated EGFP gene was cloned into the pTUG31 
vector similar to the wild-type EGFP gene, thereby creating pTug-EGFP®"*. 
(For the sequences and locations on the viral genomes of all of the primers used, 
see Table S1 in the supplemental material.) 

The SARS-CoV nsp gene fragments were obtained by reverse transcriptase- 
PCR amplification of viral RNA isolated from SARS-CoV isolate 5688 (29) with 
primers 3072 and 3073 for nsp3 (nsp3,) and primers 3070 and 3071 for nsp6 
(nsp6,). The MHV nsp gene fragments were obtained by reverse transcription- 
PCR amplification of viral genomic RNA isolated from MHV strain A59 with 
primers 3632 and 2933 for nsp3 (nsp3,,) and primers 2974 and 2975 for nsp6 
(nsp6,,). The PCR products were cloned into the pGEM-T Easy vector (Pro- 
mega), and their sequences were confirmed by sequence analysis. Site-directed 
mutagenesis to mutate the N glycosylation sites was performed on the pGem-T 
Easy constructs containing the nsp3 gene fragments with primers 3354 and 3355 
for SARS-CoV nsp3 and primers 3630 and 3631 for MHV nsp3. 

The nsp gene fragments were cloned into the pTug-EGFP or pTug-EGFP®° 
vector by digesting the pGem-T Easy constructs with EcoRI and BamHI and 
cloning the fragments obtained into the EcoRI-BamHI-digested pTug-EGFP 
and pTug-EGFP2° vectors. The plasmids created encode the different nsps 
fused C terminally to the wild-type or mutant EGFP tag. The same EcoRI- 
BamHI nsp3 and nsp6 fragments were also cloned into the EcoRI-BamHI- 
digested pTUG31 vector together with a primer dimer of primers 3050 and 3051, 
resulting in plasmids encoding the nsps C terminally fused to a HA tag. In these 
latter constructs, as well as in the pTug construct encoding MHV nsp6 fused to 
EGFP®"* (pTug-nsp6,,-EGFP2""°), an MHV M (My) tag-encoding sequence was 
inserted in front of the nsps by cloning a primer dimer of primers 3019 and 3020, 
coding for the 10-residue amino-terminal sequence of the MHV M protein 
(MSSTTQAPEP), into the XhoI-EcoRV-restricted plasmids, thereby creating 
constructs that encode nsps tagged at both termini. 

MHV nsp6 lacking the first hydrophobic domain (nsp6,,AHD1’) was amplified 
by PCR with primers 3566 and 2975. The PCR product was cloned into the 
pGEM-T Easy vector (Promega), and the sequence was confirmed by sequence 
analysis. A fragment was obtained by digestion with EcoRV and BamHI and 
cloned into the EcoRV-BamHI-digested pTugMy-nsp6,,-EGFP®"° plasmid, 
thereby creating a construct containing MHV nsp6 without the first hydrophobic 
domain fused N terminally to the MHV M tag and C terminally to the EGFP tag 
containing the N glycosylation site. 

Progressive C-terminal deletion mutant forms lacking one or more hydropho- 
bic domains were made for both nsp3 and -6. PCRs were performed with the 
same forward primers as described before; for the reverse primers used, see 
Table S1 in the supplemental material. For nsp3, the PCRs were performed on 
the constructs with the mutated N glycosylation sites. The PCR products were 
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cloned into the pGEM-T Easy vector (Promega), and the sequences were con- 
firmed by sequence analysis. Fragments were obtained by digestion with EcoRI 
and BamHI and cloned into the EcoRI-BamHI-digested pTug-EGFP®”* vector. 

By using combinations of the primers used to create the C-terminal deletions 
and primers for N-terminal deletions, the sequences encoding each of the hy- 
drophobic domains of MHV nsp6 were also amplified separately. These se- 
quences were cloned into the pGEM-T Easy vector (Promega) and confirmed by 
sequence analysis. Fragments were obtained by digestion with EcoRV and 
BamHI and cloned into the EcoRV-BamHI-digested pTugMy-nsp6,,-EGFP®° 
plasmid, thereby creating constructs encoding the MHV nsp6 fragments fused N 
terminally to the MHV M tag and C terminally to the EGFP tag containing the 
N glycosylation site. 

The construct containing SARS-CoV nsp4 in which the N glycosylation site 
had been removed by mutation has been described previously (37). This con- 
struct was used to create C-terminal deletion mutants lacking one to three of the 
hydrophobic domains. PCRs were performed with primer 3648 and primer 3848, 
3650, or 3651; the products were cloned into the pGEM-T Easy vector (Pro- 
mega); and the sequences were confirmed by sequence analysis. Fragments were 
obtained by digestion with EcoRI and BamHI and cloned into the EcoRI- 
BamHI-digested pTug-EGFP®”° vector. 

The construct containing MHV nsp4 has also been described previously (37). 
The glycosylation sites in MHV nsp4 were mutated by performing sequential 
site-directed mutagenesis reactions with primers 3758 and 3759 and primers 3760 
and 3761. The construct with the mutated glycosylation sites was used to create 
C-terminal deletion mutants by performing PCRs with primer 2890 and primer 
3847, 3756, or 3757. The PCR products were cloned into the pGEM-T Easy 
vector (Promega), and the sequences were confirmed by sequence analysis. 
Fragments were obtained by digestion with EcoRI and BamHI and cloned into 
the EcoRI-BamHI-digested pTug-EGFP®"° vector. 

The constructs encoding the equine arterivirus (EAV) membrane protein N 
terminally extended with the My tag (EAV M + 9A) (10) and the 8a protein C 
terminally tagged with the EGFP tag with or without the N glycosylation site (36) 
have been described previously. 

Infection and transfection. Subconfluent monolayers of OST7-1 cells grown in 
10-cm? tissue culture dishes were inoculated with vIF7-3 at a multiplicity of 
infection of 10 for 1 h, after which the medium was replaced with a transfection 
mixture consisting of 0.5 ml of DMEM without FCS but containing 10 pl 
Lipofectin (Invitrogen) and 5 wg of each selected construct. After a 5-min 
incubation at room temperature, 0.5 ml of DMEM was added and incubation was 
continued at 37°C. At 3 h postinfection (p.i.), the medium was replaced with 
culture medium. Where indicated, tunicamycin (5 yg/ml) or brefeldin A (6 
pg/ml) was added to the culture medium at 3 h p.i. 

Metabolic labeling and immunoprecipitation. Prior to labeling, the cells were 
starved for 30 min in cysteine- and methionine-free modified Eagle’s medium 
containing 10 mM HEPES (pH 7.2) and 5% dialyzed FCS. This medium was 
replaced with 1 ml of a similar medium containing 100 Ci of *S in vitro 
cell-labeling mixture (Amersham), after which the cells were further incubated 
for the indicated time periods. After labeling, the cells were washed once with 
phosphate-buffered saline (PBS) containing 50 mM Ca?* and 50 mM Mg?* and 
then lysed on ice in 1 ml of lysis buffer (0.5 mM Tris [pH 7.3], 1 mM EDTA, 0.1 
M NaCl, 1% Triton X-100) per 10-cm* dish. The lysates were cleared by cen- 
trifugation for 5 min at 15,000 rpm and 4°C. 

Coupled in vitro transcription and translation reactions were performed 
with the TNT coupled reticulocyte lysate system from Promega, according to 
manufacturer’s instructions, in the presence of *S in vitro labeling mixture 
(Amersham) but without the use of microsomal membranes. 

Radioimmunoprecipitations were performed essentially as described previ- 
ously (35); 200-1 aliquots of the cell lysates or 5-1 volumes of in vitro trans- 
lation reaction mixtures were diluted in 1 ml detergent buffer (50 mM Tris [pH 
8.0], 62.5 mM EDTA, 1% NP-40, 0.4% sodium deoxycholate, 0.1% sodium 
dodecyl sulfate [SDS]) containing antibodies (2 yl rabbit anti-EGFP or rabbit 
anti-HA serum or 25 pl of J1.3 monoclonal anti-MHV M serum). The immu- 
noprecipitation mixtures were incubated overnight at 4°C. The immune com- 
plexes were adsorbed to Pansorbin cells (Calbiochem) for 60 min at 4°C and 
subsequently collected by centrifugation. The pellets were washed three times by 
resuspension and centrifugation with RIPA buffer (10 mM Tris [pH 7.4], 150 mM 
NaCl, 0.1% SDS, 1% NP-40, 1% sodium deoxycholate). The final pellets were 
suspended in Laemmli sample buffer (LSB) and heated at 95°C for 1 min before 
analysis by SDS-polyacrylamide gel electrophoresis (PAGE) with 10 to 15% 
polyacrylamide gels. 

Where indicated, immunoprecipitates were treated with peptide—N-glycosi- 
dase F (PNGaseF; New England BioLabs). To this end, the final immunopre- 
cipitation pellets were suspended in PBS instead of LSB, 2 wl PNGaseF was 
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FIG. 2. Processing of SARS-CoV and MHV nsp3 and nsp6. vIF7-3-infected OST7-1 cells were transfected with the indicated constructs. The 
cells were labeled with [*°S]methionine from 5 to 6 h p.i., lysed, and processed for immunoprecipitation with antiserum directed to the EGFP tag, 
followed by SDS-PAGE. (A) Cells were transfected with SARS-CoV or MHV nsp3-EGFP (nsp3,-EGFP or nsp3,,-EGFP, respectively)-encoding 
constructs without or with mutation (—glyc) of the N glycosylation sites in the presence (TM) or absence (— and P) of tunicamycin. The constructs 
with intact glycosylation sites were also transcribed and translated in vitro with the TNT coupled reticulocyte lysate system from Promega (ivt.). 
After immunoprecipitations, the samples were mock (TM, —) or PNGaseF (P) treated. (B) Cells were transfected with SARS-CoV or MHV 
nsp6-EGFP (nsp6,-EGFP and nsp6,,-EGFP, respectively)-encoding constructs in the presence (+) or absence (—) of tunicamycin (TM). The same 
constructs were also transcribed and translated in vitro with the TNT coupled reticulocyte lysate system from Promega (ivt.). The positions and 
masses (in kilodaltons) of protein size markers are indicated at the left. Only the relevant portion of the gels is shown. 


added, and the samples were incubated at 37°C for 2 h. Before analysis by 
SDS-PAGE, 0.5 volume of a three-times-concentrated solution of LSB was 
added to the samples, which were then heated at 95°C for 1 min. 

Immunofluorescence microscopy. OST7-1 cells grown on glass coverslips were 
fixed at the indicated times postinfection with 3% paraformaldehyde for 1 h at room 
temperature. The fixed cells were washed with PBS and permeabilized with either 
0.1% Triton X-100 for 10 min at room temperature or 0.5 g/ml digitonin (diluted 
in 0.3 M sucrose-25 mM MgCl*-0.1 M KCl-1 mM EDTA-10 mM PIPES [pH 
6.8]) for 5 min at 4°C. Next, the permeabilized cells were washed with PBS and 
incubated for 15 min in blocking buffer (PBS-10% normal goat serum), followed by 
a 45-min incubation with antibodies directed against HA or against the C- or 
N-terminal domain of MHV M. After four washes with PBS, the cells were incu- 
bated for 45 min with either fluorescein isothiocyanate-conjugated goat anti-rabbit 
immunoglobulin G antibodies (ICN) or Cy5-conjugated donkey anti-mouse immu- 
noglobulin G antibodies (Jackson Laboratories). After four washes with PBS, the 
samples were mounted on glass slides in FluorSave (Calbiochem). The samples were 
examined with a confocal fluorescence microscope (Leica TCS SP2). 


RESULTS 


Processing of nsp3 and nsp6. We started our studies of nsp3 
and nsp6 by analyzing the co- and posttranslational modifica- 
tions of the two proteins from both SARS-CoV and MHV. To 
this end, gene fragments encoding nsp3 and nsp6 of both 
viruses were cloned into the pTUG31 vector behind a T7 
promoter and fused to an EGFP tag, since no antibodies to the 
proteins themselves were available. As the full-length nsp3- 
encoding genome segments are very large (approximately 6 kb) 
and difficult to clone, only the 3’-terminal 2-kb fragments, 
which encode all of the hydrophobic domains, were cloned. 
nsp3 of both SARS-CoV and MHV contains potential N gly- 
cosylation sites (NXS/T) in front of the first hydrophobic do- 
main, between the first and second hydrophobic domains, and 
behind the third hydrophobic domain (Fig. 1 shows the local- 
ization of the hydrophobic domains). The glycosylation sites 
between the first and second hydrophobic domains, two for 
SARS-CoV and one for MHV, have previously been shown to 


be functional (22, 28). No potential N glycosylation sites were 
identified in the sequence of MHV nsp6, while for SARS-CoV 
nsp6, an N glycosylation site was predicted between the fifth 
and sixth hydrophobic domains. 

We studied the expression and processing of the nsps by in 
vitro translation and by using the recombinant vaccinia virus 
bacteriophage T7 RNA polymerase (vTF7-3) expression sys- 
tem. OST7-1 cells were infected with vITF7-3, transfected with 
plasmids containing the nsp3-EGFP or nsp6-EGFP gene, and 
labeled with [*°S]methionine from 5 to 6 h p.i. The cells were 
lysed and processed for immunoprecipitation with a rabbit 
polyclonal antiserum directed to the EGFP tag. In parallel, in 
vitro translations were performed with the TNT coupled re- 
ticulocyte lysate system from Promega in the absence of mem- 
branes to analyze the electrophoretic mobility of the nonproc- 
essed proteins. To demonstrate the presence of the N-linked 
sugars on the nsps, the proteins were expressed in the presence 
and absence of tunicamycin, which is an inhibitor of N-linked 
glycosylation, and/or the N-linked glycans were enzymatically 
removed with PNGaseF. 

As shown in Fig. 2A, the electrophoretic mobility of nsp3 
expressed in OST7-1 cells in the presence of tunicamycin was 
similar to that of the in vitro-translated product, whereas the 
protein expressed in the absence of tunicamycin migrated 
slower. Treatment of this latter protein with PNGaseF shifted 
its electrophoretic mobility to that of the in vitro-translated 
product and of the protein expressed in the presence of tuni- 
camycin. This result confirms the addition of N-linked glycans 
to nsp3, as has been demonstrated previously (22, 28). Next, 
the N glycosylation sites between the first and second hydro- 
phobic domains were mutated and these proteins, in fusion 
with EGFP (nsp3~®"*-EGFP), were also expressed in the 
presence and absence of tunicamycin and/or treated with 
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FIG. 3. Glycosylation of tagged SARS-CoV and MHV nsp3 and nsp6. vIF7-3-infected OST7-1 cells were transfected with the indicated 
constructs. The cells were labeled with [?°S]methionine from 5 to 6 h p.i., lysed, and processed for immunoprecipitation, followed by SDS-PAGE. 
(A) Cells were transfected with constructs encoding SARS-CoV or MHV nsp3 or nsp6 or encoding the SARS-CoV orf8a protein (8a), each fused 
either to a wild-type EGFP tag (wt) or to an EGFP tag with an N glycosylation site (+glyc). The proteins were expressed in the presence (+) or 
absence (—) of tunicamycin (TM). Immunoprecipitations were performed with rabbit antiserum directed to the EGFP tag. (B) Cells were 
transfected with constructs encoding SARS-CoV or MHV nsp3 or nsp6 with a C-terminal HA tag and an N-terminal My tag (’) or with a construct 
encoding the EAV M protein with the same My tag (EAV M + 9A). MHV nsp6 lacking the first hydrophobic domain (nsp6,,AHD1') contains 
the N-terminal My tag in combination with a C-terminal EGFP tag. The cells were incubated in the presence (+) or absence (—) of brefeldin A 
(bref A) and or tunicamycin (TM). Immunoprecipitations were performed with rabbit antiserum directed to the HA tag or, for EAV M, with 
monoclonal antibody J1.3 (directed against the My tag). Only the relevant portions of the gels are shown. 


PNGaseF. The presence of tunicamycin or treatment with 
PNGaseF did not influence the electrophoretic mobility of 
these mutant proteins (Fig. 2A). This clearly demonstrates that 
the N glycosylation sites between the first and second hydro- 
phobic domains are the only N-glycan attachment sites. The 
other potential sites are likely inaccessible, probably because 
they are located on the cytoplasmic side of the membrane. 
vTF7-3-expressed nsp6 of both viruses comigrated in the gel 
with the corresponding in vitro-translated products, both in the 
presence and in the absence of tunicamycin (Fig. 2B). Also, 
some lower-molecular-weight products were observed after the 
in vitro translation of MHV nsp6, which probably resulted 
from translation initiation at more downstream start codons. 
Apparently, nsp6 of SARS-CoV or MHV is not N glycosylated, 
indicating that the putative glycosylation site in the region 
between the fifth and sixth hydrophobic domains of SARS- 
CoV nsp6 is either located on the cytoplasmic side of the 
membrane or not accessible for glycosylation for other reasons. 
Furthermore, it appeared that both nsp6 fusion proteins mi- 
grated with lower mobility in the gel than predicted on the 
basis of their amino acid sequences. Similar results have been 
obtained before for other highly hydrophobic proteins (37, 41). 
Membrane topology of nsp3 and nsp6. In order to elucidate 
the membrane topology of SARS-CoV and MHV nsp3 and 
nsp6, we studied the disposition of their amino and carboxy 
termini. Therefore, the proteins were C or N terminally ex- 
tended with tags containing potential glycosylation sites. The C 
termini of the proteins were fused to an EGFP tag in which an 
N glycosylation site had been created (EGFP®"*). As a control, 
we used a fusion protein generated earlier which essentially 
consists of a cleavable signal sequence fused to the same tag 
(8a-EGFP) (36). For nsp3, the tag was fused to either the 
SARS-CoV or the MHV gene fragment in which the N glyco- 
sylation sites between the first and second hydrophobic do- 


mains had been disabled by mutation (nsp3~®""°) to allow dis- 
crimination between glycosylation of the tag and that of nsp3 
itself. The fusion proteins were expressed with the vIF7-3 
expression system in the presence and absence of tunicamycin. 
N glycosylation of the C-terminal EGFP tag would demon- 
strate that the carboxy terminus of the protein is located on the 
lumenal side of the membrane. 

For each of the proteins (nsp3 or nsp6), a similar electro- 
phoretic mobility was observed regardless of the EGFP tag 
used (i.e., with or without the N glycosylation site) or the 
presence of tunicamycin (Fig. 3A). The control protein, 8a, 
behaved as expected. In the presence of tunicamycin, the pro- 
tein with the EGFP®"* tag migrated with the same mobility as 
the protein with the wild-type tag in the absence of tunicamy- 
cin, whereas in the absence of tunicamycin the protein with the 
EGFP®"* tag migrated slower (Fig. 3A). The results demon- 
strate that the carboxy termini of nsp3 and nsp6 of both SARS- 
CoV and MHV are located on the cytoplasmic side of the 
membrane. 

To examine the disposition of the nsp3 and nsp6 amino 
termini, the N-terminal 10-residue sequence of the MHV M 
protein (M,,), which contains a well-defined O glycosylation 
motif, was fused to the amino terminus of each nsp. The 
functionality of this tag was previously demonstrated after fu- 
sion to the EAV type III M protein, resulting in EAV M + 9A 
(10). This protein has a Nexo/Cendo topology and is retained 
in the ER. Yet it became O glycosylated upon the addition of 
brefeldin A, a drug which causes the redistribution of Golgi 
enzymes, including the ones involved in O glycosylation, to the 
ER. By a similar approach, the location of the N termini of 
nsp3 and nsp6 was assessed. The nsp fusion proteins, contain- 
ing the N-terminal My and a C-terminal HA tag, were ex- 
pressed with the vIF7-3 expression system in OST7-1 cells in 
the presence or absence of brefeldin A and/or tunicamycin. 
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Tunicamycin was added to the cells expressing the nsp3 fusion 
protein to prevent its N glycosylation, as this could obscure the 
detection of its O glycosylation. 

As shown in Fig. 3B, the EAV M + 9A control protein 
showed a shift in electrophoretic mobility when it was ex- 
pressed in the presence of brefeldin A. In contrast, the elec- 
trophoretic mobilities of the nsp3 and nsp6 fusion proteins 
were unaffected by the addition of brefeldin A. As expected, 
the presence of tunicamycin did prevent the addition of N- 
linked sugars to nsp3. These results indicate that the amino 
termini of SARS-CoV and MHV nsp3 and nsp6 are not ac- 
cessible to enzymes that mediate the addition of O-linked 
sugars, which is most likely explained by the cytoplasmic ex- 
posure of these termini, although misfolding of the amino- 
terminal tag as the cause can also not be ruled out completely. 
We do not consider the latter explanation very likely, since the 
presence of two proline residues in the tag has previously been 
demonstrated to induce a conformation favorable for glycosy- 
lation (10). Indeed, when the first hydrophobic domain of 
MHV nsp6 was removed, the resulting protein carrying the 
amino-terminal tag (now in combination with a C-terminal 
EGFP tag) did become modified by O-linked sugars in the 
presence of brefeldin A, as shown by the appearance of an 
extra band which runs at a slightly higher position in the gel. 
The difference in electrophoretic mobility between the glyco- 
sylated and unglycosylated protein species is smaller for the 
nsp6 mutant than for the EAV M protein because of the much 
higher molecular weight of EGFP-tagged nsp6 (Fig. 3B). 

The localization of the amino and carboxy termini of each 
nsp was also determined by immunofluorescence assays. In 
these experiments, nsps were used that were tagged at both 
ends, containing the N-terminal M,, and a C-terminal HA 
extension. OST7-1 cells were infected with vIF7-3 and trans- 
fected with plasmids encoding the fusion proteins. The cells 
were fixed at 6 h p.i. with 3% paraformaldehyde and perme- 
abilized under strictly controlled conditions with either Triton 
X-100, which permeabilizes all cellular membranes, or digito- 
nin, which selectively permeabilizes the plasma membrane. 
The type IIT MHV M protein, which has a known Nexo/Cendo 
topology and localizes to the Golgi compartment, was used as 
a control. A rabbit polyclonal antibody directed to the C ter- 
minus and a mouse monoclonal antibody directed to the N 
terminus were used to detect this protein after the use of each 
of the permeabilization methods. As expected, the antibody 
directed to the C terminus detected the protein after Triton 
X-100 permeabilization, as well as after digitonin permeabili- 
zation, whereas the antibody directed to the N terminus only 
detected the protein after permeabilization with Triton X-100 
and not after treatment with digitonin, thereby validating the 
assay conditions (Fig. 4). 

Similar experiments were performed for SARS-CoV and 
MHV nsp3 and nsp6, with the exception that a different rabbit 
antiserum, directed against the C-terminal HA tag, was used. 
Both nsp3 and nsp6 appeared to localize in a reticular pattern 
reminiscent of the ER, as observed at higher magnification 
(data not shown). After permeabilization with Triton X-100 or 
digitonin, cells were stained with the rabbit antibody directed 
to the C terminus and the mouse antibody directed to the N 
terminus. The results, shown in Fig. 4, demonstrate that the 
amino and carboxy termini of both nsp3 and nsp6 are located 
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FIG. 4. Membrane topology of SARS-CoV and MHV nsp3 and nsp6. 
vTF7-3-infected OST7-1 cells were transfected with constructs encoding 
the proteins indicated at the left. The cells were fixed at 6 h p.i. and 
permeabilized with Triton X-100 (left two columns) or digitonin (right 
two columns). Immunofluorescence analysis was performed with antibod- 
ies against the C-terminal (C-term.) tag, anti-M. for M,, and anti-HA for 
the nsps (first and third columns), or against the N-terminal (N-term.) tag, 
J1.3 (anti-My, second and fourth columns). 


on the cytoplasmic face of the membrane and confirm the 
results obtained with the biochemical experiments shown in 
Fig. 3. Furthermore, the localization of the nsp3 amino termi- 
nus on the cytoplasmic face of the membrane is in agreement 
with the observed N glycosylation in the region between the 
first and second transmembrane domains. 

Membrane integration of nsp3. The above results show that 
both nsp3 and nsp6 have a Nendo/Cendo membrane topology, 
indicating that both nsps have an even number of transmem- 
brane domains. This is not in agreement with the three and 
seven transmembrane domains that are predicted for nsp3 and 
nsp6, respectively. Therefore, we examined which of the pre- 
dicted transmembrane domains are indeed used as such. To 
this end, constructs were made that encode progressive C- 
terminal deletion mutant forms of the nsps, lacking one or 
more hydrophobic domains, fused to the EGFP®"* tag. 

As SARS-CoV nsp3 appeared to be N glycosylated between 
the first two of its three predicted transmembrane domains, it 
seems most likely that either the second or the third hydro- 
phobic domain does not function as a membrane-spanning 
domain. Mutant forms were made in which the C-terminal 
hydrophilic tail was deleted or in which the C-terminal deletion 
was extended to include either the third hydrophobic domain 
or both the third and second hydrophobic domains (Fig. 5A). 
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FIG. 5. Membrane integration of deletion mutant forms of SARS-CoV and MHV nsp3. (A, D) Schematic representations of the C-terminal 
(C-term.) deletion mutant forms of SARS-CoV (A) and MHV (D) nsp3, with the hydrophobic domains presented as black rectangles and the 
EGFP®" tag in gray. The corresponding hydrophobic domains in the two proteins are indicated by numbers. (B, E) vIF7-3-infected OST7-1 cells 
were transfected with the indicated constructs and expressed in the presence (+) or absence (—) of tunicamycin (TM). The cells were labeled with 
[S]methionine from 5 to 6 h p.i., lysed, and processed for immunoprecipitation with anti-EGFP antiserum followed by SDS-PAGE. The positions 
and masses (in kilodaltons) of the protein size markers are indicated at the left. The asterisks indicate the position of the glycosylated protein 
species. Below the panels, the observed presence or absence of N glycosylation is indicated by a plus or a minus sign, respectively. (C, F) Models 
of the membrane structures of SARS-CoV (C) and MHV (F) nsp3, with the hydrophobic domains presented as black rectangles. For comparison, 


the MHV nsp3 model proposed by Baker and coworkers is shown below in gray. 


These deletion mutant forms, which were fused to the 
EGFP®"* tag, additionally carried the mutations that disrupt 
the N glycosylation motifs between the first and second hydro- 
phobic domains. 

vTF7-3-infected OST7-1 cells were transfected with constructs 
encoding the different fusion proteins and labeled from 5 to 6 h 
p.i. in the presence or absence of tunicamycin. The cells were 
lysed and processed for immunoprecipitation with the anti-EGFP 
serum. As shown in Fig. 5B, complete removal of the hydrophilic 
C terminus of nsp3 (AC-tail) does not affect the glycosylation of 
the fusion protein, as expected. When the third hydrophobic do- 
main was additionally deleted, the fusion protein still remained 
indifferent to tunicamycin and was thus not glycosylated, indicat- 


ing that this third hydrophobic domain did not function as a 
transmembrane domain (Fig. 5B). However, when the second 
hydrophobic domain was deleted as well, an extra protein species 
with a slightly lower electrophoretic mobility appeared after ex- 
pression in the absence of tunicamycin compared to in its pres- 
ence (Fig. 5B). Although the protein was only partially glycosy- 
lated, this result demonstrates that the carboxy-terminal EGFP 
tag fused to this mutant form of nsp3 is translocated to the 
lumenal side of the membrane. As nsp3AHD2-3 was the only 
mutant form that became modified by N-linked sugars, we con- 
clude that SARS-CoV nsp3 spans the lipid bilayer only twice and 
that the third hydrophobic domain does not function as a trans- 
membrane domain (Fig. 5C). 
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FIG. 6. Membrane integration of deletion mutant forms of MHV nsp6. (A) Schematic representation of the MHV nsp6 C-terminal deletion 
mutant forms, with the hydrophobic domains presented in black and the EGFP®"* tag in gray. (B) vIF7-3-infected OST7-1 cells were transfected 
with the constructs presented in panel A and expressed in the presence (+) or absence (—) of tunicamycin (TM). The cells were labeled with 
[?°S]methionine from 5 to 6h p.i., lysed, and processed for immunoprecipitation with anti-EGFP antiserum, followed by SDS-PAGE. The positions 
and masses (in kilodaltons) of the protein size markers are indicated at the left. The asterisks indicate the positions of the glycosylated protein 
species. Below the gel, the observed presence or absence of N glycosylation is indicated by a plus or a minus sign, respectively. (C) Models of the 
membrane structures of the C-terminal deletion mutant forms of MHV nsp6. The hydrophobic domains are presented as black rectangles, and the 
EGFP tag is presented as a dark gray rectangle when glycosylated and in light gray when unglycosylated. 


The transmembrane domain predictions for MHV nsp3 are 
somewhat different from those for SARS-CoV nsp3. Several 
programs predict the presence of five rather than three trans- 
membrane domains. Three of these correspond to those in 
SARS-CoV nsp3, but additional hydrophobic domains are lo- 
cated immediately up- and downstream of the first domain 
(Fig. 5D). However, not all programs predict these additional 
hydrophobic domains to be membrane spanning (28). Earlier 
work by Baker and coworkers, with a combination of in vitro 
translations, carbonate extraction assays, and proteinase K 
treatments, suggested that four of the five predicted hydropho- 
bic domains functioned as transmembrane domains, with only 
the fourth domain (i.e., the one corresponding to the second 
hydrophobic domain in SARS-CoV nsp3) not spanning the 
lipid bilayer (28). As these results are in conflict with the 
results we obtained for SARS-CoV nsp3, we prepared a similar 
set of progressive C-terminal deletion mutant forms of MHV 
nsp3 lacking one, two, three, or four of the putative transmem- 
brane domains and again fused to the EGFP®""" tag (Fig. 5D). 

The different constructs were expressed with the vIF7-3 
system in the presence or absence of tunicamycin. The EGFP 
tag was again not glycosylated when fused to the unmodified 
carboxy terminus of nsp3 or when the last hydrophobic domain 
(HDS) was deleted, consistent with the results obtained for 
SARS-CoV nsp3 (Fig. SE). However, when the fourth hydro- 
phobic domain was deleted as well, the fusion protein became 
partially glycosylated, as shown by the presence of a protein 
species the appearance of which was inhibited by tunicamycin 
(Fig. SE), indicating that the EGFP tag had become translo- 
cated. The same result, though with more efficient glycosyla- 


tion, was obtained when also the third hydrophobic domain 
was removed. Finally, additional deletion of the second hydro- 
phobic domain, which corresponds to the first hydrophobic 
domain of SARS-CoV nsp3, resulted in an unglycosylated pro- 
tein as its electrophoretic mobility was not affected by the 
presence of tunicamycin (Fig. 5E). As nsp3AHD4-5 and 
nsp3AHD3-5 were the only MHV nsp3 mutant constructs that 
became modified by N-linked sugars, while the other mutants 
showed no trace of glycosylation, we conclude that MHV nsp3 
integrates into the membrane similarly to SARS-CoV nsp3, 
with only two membrane-spanning domains at approximately 
the same positions in the protein (Fig. 5F). 

Membrane integration of nsp6. Transmembrane domain 
predictions for nsp6 yielded similar results for the MHV and 
SARS-CoV proteins, as well as for other coronaviruses shown 
in the multiple alignment of the orfla sequence. Because 
SARS-CoV nsp6 consistently appeared as a fuzzy band when 
analyzed by SDS-PAGE, which will complicate the interpreta- 
tion of our assay, we limited our focus to MHV nsp6. The 
observed cytoplasmic localization of both the amino and the 
carboxy termini of nsp6 implies the presence of an even num- 
ber of transmembrane domains, although invariably seven such 
domains are predicted. Thus, as for nsp3, one of the predicted 
transmembrane domains is probably not used as such. To con- 
firm this conclusion and identify this particular domain, we 
made progressive C-terminal deletion mutant constructs lack- 
ing one to six of the potential transmembrane domains and 
tagged once again with the EGFP®”* tag (Fig. 6A) and evalu- 
ated these mutant constructs as before. 

As shown in Fig. 6B, when only the first hydrophobic domain 
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was present (AHD2-7), the clear electrophoretic mobility dif- 
ference caused by tunicamycin indicated that the EGFP tag 
was glycosylated and thus present on the lumenal side of the 
membrane, implying that the first hydrophobic domain func- 
tions both as a signal sequence and as a transmembrane do- 
main. The mutant protein containing the first two hydrophobic 
domains (AHD3-7) remained unglycosylated, indicating that 
also the second hydrophobic domain spans the membrane. 
Mutant proteins with additional transmembrane domains con- 
tinued to reveal alternatingly the appearance and disappear- 
ance of N-linked modifications, indicating that the predicted 
transmembrane domains are integrated in the lipid bilayer. 
However, when reaching the sixth hydrophobic domain, the 
regular succession was interrupted as this protein, AHD7, ap- 
peared to become glycosylated, though only marginally. The 
wild-type protein showed no sign of glycosylation. These ob- 
servations lead to a model in which only six hydrophobic do- 
mains in MHV nsp6 are actually used as transmembrane do- 
mains. Our data appear to indicate that the sixth hydrophobic 
domain does not span the lipid bilayer; however, in view of the 
very inefficient glycosylation of the AHD7 mutant construct, we 
cannot fully exclude the possibility that the seventh hydropho- 
bic domain is not membrane spanning (Fig. 6C). 

The efficiency of glycosylation that we observed in this de- 
letion assay appeared to decrease with increasing protein 
length. We have no clear explanation for this. Perhaps the 
accessibility of the glycosylation site differs between the differ- 
ent proteins, though this does not seem very likely, assuming 
that the EGFP moiety folds independently and similarly in all 
cases its glycosylation site is always presented in the same way. 
Alternatively, the longer proteins might adopt multiple alter- 
native membrane topologies, though this is again not a very 
likely possibility since such behavior is not observed for 
AHD3-7, AHDS5-7, and full-length nsp6, as judged by their 
complete lack of glycosylation. It seems more plausible that the 
functioning of the third, and particularly the fifth, hydrophobic 
domain as an internal signal sequence for membrane insertion 
is somehow hampered by the EGFP extension, resulting in 
decreased efficiency of tag translocation. 

In view of these uncertainties and to obtain further evidence 
for our model, an additional set of mutant proteins was created 
in which each one of the MHV nsp6 hydrophobic domains was 
individually positioned between a C-terminal EGFP®”* tag and 
an N-terminal M,, tag containing an O glycosylation site (Fig. 
7A). The HD1 protein is, in fact, identical to nsp6AHD2-7 
(Fig. 6), except for the amino-terminal tag containing the O 
glycosylation site. The proteins were expressed with the vIF7-3 
system in the presence and absence of brefeldin A or tunica- 
mycin and analyzed as before (Fig. 7B). The fusion proteins 
containing hydrophobic domain 1, 2, 3, 4, or 5 appeared in two 
electrophoretic forms when expressed in the absence of any 
drug. The slower-migrating form of these was not observed 
when tunicamycin had been applied, indicating that these pro- 
teins became N glycosylated. The proteins that contained ei- 
ther hydrophobic domain 2, 4, 6, or 7 exhibited a partial shift 
in electrophoretic mobility upon expression in the presence of 
brefeldin A, indicating that these proteins were modified by 
O-linked sugars. The addition of O-linked sugars was less ef- 
ficient for the proteins containing either hydrophobic domain 
6 or 7. Strikingly, the proteins that contained hydrophobic 
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domain 2 or 4 were apparently able to adopt two alternative 
topologies, as both types of glycosylation were detected. These 
proteins are either N or O glycosylated, but not both, since in 
that case expression in the presence of brefeldin A would have 
resulted in an additional reduction in electrophoretic mobility 
relative to the N-glycosylated protein species. Thus, these pro- 
teins are still membrane associated. These results indicate that, 
in principle, all hydrophobic domains are able to individually 
mediate insertion into the membrane and to function as trans- 
membrane domains, although with different efficiencies, where 
especially domains 6 and 7 were inserted with lower efficiency 
(Fig. 7C). All together, the results are consistent with our 
model in which nsp6 has both of its termini exposed on the 
cytoplasmic face of the membrane while spanning the lipid 
bilayer six times. Of the seven predicted transmembrane do- 
mains, the sixth or seventh is least likely to function as such. 

Membrane integration of nsp4. In view of the observations 
with nsp3 and nsp6, we wanted to complete these evaluations by 
similarly analyzing the membrane structure of the third viral 
membrane protein involved in anchoring the replication complex, 
nsp4. Our earlier study already revealed that both termini of this 
protein are oriented cytoplasmically and that the protein, both for 
SARS-CoV and for MHYV, is N glycosylated in a region between 
the first two of its four hydrophobic domains, which led us to 
propose a tetraspanning structure (37). Yet, we could not exclude 
the possibility that actually two of the three carboxy-terminal 
hydrophobic domains are not membrane spanning. Hence, we 
constructed progressive C-terminal deletion mutant constructs of 
both MHV and SARS-CoV nsp4 in which the natural glycosyla- 
tion sites had additionally been removed by mutagenesis and 
which were again fused to the EGFP®" tag (Fig. 8A and D). The 
proteins were expressed in the presence and absence of tunica- 
mycin and analyzed as before. 

The nsp4 mutant forms of both viruses gave similar results 
(Fig. 8B and E). Full-length nsp4-EGFP®"* remained unglyco- 
sylated, as expected (37). When only the first hydrophobic 
domain was present (AHD2-4), the fusion proteins became 
efficiently N glycosylated, as shown by the complete shift in 
electrophoretic mobility due to the presence of tunicamycin. 
This is again as expected and confirms that the first hydropho- 
bic domain spans the lipid bilayer. When the first two hydro- 
phobic domains were present (AHD3-4), the electrophoretic 
mobility was not influenced by the presence of tunicamycin. 
Since this fusion protein was not N glycosylated, its C terminus 
was located on the cytoplasmic side of the membrane, implying 
that also the second hydrophobic domain spans the lipid bi- 
layer. N glycosylation was again observed, though especially for 
MHV nsp4 not very efficiently, when only the fourth hydro- 
phobic domain was deleted (AHD4), as demonstrated by the 
higher-molecular-weight band seen in the absence of tunica- 
mycin but not in its presence, indicating that also the third and 
fourth hydrophobic domains span the membrane. For several 
nsp4 deletion mutant constructs, two rather than one tunica- 
mycin-resistant protein species could be detected, which is 
explained by the partial removal of the first transmembrane 
domain by signal peptidase (37; data not shown). It appeared 
that signal peptidase cleavage of nsp4 was more efficient for 
the smaller deletion mutant proteins. N glycosylation of the 
SARS-CoV nsp4AHD4 mutant form combined with signal 
peptidase cleavage resulted in an electrophoretic mobility sim- 
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FIG. 7. Membrane integration of MHV nsp6 hydrophobic domains. (A) Schematic representation of the MHV nsp6 constructs. (B) VITF7-3- 
infected OST7-1 cells were transfected with the constructs presented in panel A and expressed in the presence (+) or absence (—) of tunicamycin 
(TM) or brefeldin A (brefA). The cells were labeled with [*°S]methionine from 5 to 6 h p.i., lysed, and processed for immunoprecipitation with 
anti-EGFP antiserum, followed by SDS-PAGE. The positions and masses (in kilodaltons) of the protein size markers are indicated at the left. Only 
the relevant portion of the gel is shown. The asterisks indicate the position of the N-glycosylated protein species, and the arrowhead indicates that 
of the O-glycosylated protein species. Below the panels, the observed presence or absence of N- and or O-linked oligosaccharides is indicated. 
(C) Model of the membrane topologies of the nsp6 hydrophobic domains. The hydrophobic domains are presented as black rectangles, the 
EGFP®”* tag is presented as a gray rectangle, and the My tag is presented as a gray circle. Dark gray rectangles or circles represent glycosylated 
tags, and light gray rectangles or circles represent unglycosylated tags. The dotted line connects the transmembrane domains in accordance with 
the full-length nsp6 structure, indicating that either the sixth or the seventh hydrophobic domain does not traverse the lipid bilayer. 


ilar to that of the unglycosylated and uncleaved protein (hence 
the two asterisks in Fig. 8E, which indicate the position of the 
N-glycosylated protein species), similar to what has been ob- 
served previously (37). The results show that, in contrast to 
nsp3 and nsp6, all of the predicted transmembrane domains of 
nsp4 span the lipid bilayer (Fig. 8C and F). 


DISCUSSION 


Coronaviruses have exceptionally large RNA genomes and 
exhibit complex replication and transcription strategies. These 
processes take place at DMVs located in the perinuclear re- 
gion of the host cell. Although the membranes of these struc- 
tures are most likely derived from the ER, the way they are 
organized and modified to function as the scaffolds of the 


replication complexes is still largely unknown. Presumably, as 
has been demonstrated for other RNA viruses, the hydropho- 
bic nsps are key organizers of the observed membrane rear- 
rangements. All known members of the families Coronaviridae 
and Arteriviridae encode three nsps containing hydrophobic 
domains. Invariably, hydrophobic domains are present in the 
two nsps surrounding the 3C-like main proteinase encoded by 
nsp5 of coronaviruses and by nsp4 of arteriviruses, while ad- 
ditional hydrophobic domains are found in the large nsp im- 
mediately upstream of this cluster, which further contains the 
papain-like proteinase domains (19). For EAV, the coexpres- 
sion of nsp2 and nsp3, which are the homologues of coronavi- 
rus nsp3 and nsp4, is sufficient to induce the formation of 
DMVs (49). Similar results, however, have so far not been 
reported for coronaviruses. In order to get more insight into 
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FIG. 8. Membrane integration of deletion mutant forms of MHV and SARS-CoV nsp4. (A, D) Schematic representations of the MHV (A) and 
SARS-CoV (D) nsp4 C-terminal deletion mutant forms, with the hydrophobic domains presented as black rectangles and the EGFP®"* tag shown 
in gray. (B, E) vIF7-3-infected OST7-1 cells were transfected with the indicated constructs and expressed in the presence (+) or absence (—) of 
tunicamycin (TM). The cells were labeled with [*°S]methionine from 5 to 6 h p.i., lysed, and processed for immunoprecipitation with anti-EGFP 
antiserum, followed by SDS-PAGE. The positions and masses (in kilodaltons) of the protein size markers are indicated at the left. Only the relevant 
portions of the gels are shown. The asterisks indicate the position of the glycosylated protein species. Below the panels, the observed presence or 
absence of N glycosylation is indicated by a plus or a minus sign, respectively (C, F). Models of the membrane structure of nsp4 of MHV (C) and 
SARS-CoV (F), with the hydrophobic domains presented as black rectangles. 


the function of the coronavirus hydrophobic nsps, we studied 
the topology and membrane integration of these proteins. The 
results demonstrate that, similar to nsp4 (37), both nsp3 and 
nsp6 have a Nendo/Cendo topology with an even number of 
transmembrane domains, although for both proteins an un- 
even number of transmembrane domains was predicted. The 
conservation of hydrophobic domains not actually serving as 
transmembrane domains suggests that these domains may play 
important roles in the formation, stabilization, or functioning 
of the replication complexes. 

Membrane structures of complex multispanning proteins are 
notoriously difficult to establish. Ideally, one would like to 
probe the disposition of particular domains in such proteins by 
introducing or removing glycosylation sites and by using spe- 
cific antisera raised to peptides corresponding to these do- 
mains. Removal of the glycosylation sites in the large lumenal 
loops of nsp3 and nsp4 was relatively straightforward. Unfor- 
tunately, however, no glycosylation motifs occurred in the 
other, much smaller loops, and motifs introduced into such 
loops appeared to be nonfunctional (data not shown), probably 
because they are not accessible to the enzymes. As domain- 
specific antibodies are not available—and are probably hard to 
generate due to the small size of most loops—we decided to 
use other frequently applied approaches, which involved tag- 


ging of the nsps with epitopes or EGFP carrying O or N 
glycosylation sites in combination with progressive deletions of 
hydrophobic domains. A potential drawback of these tech- 
niques is that the manipulation may affect the conformational 
behavior of the protein. Thus, deletion mutant proteins often 
appeared to be only partially glycosylated. While this might be 
taken as an indication that the protein can adopt more than 
one membrane topology, we do not consider this interpreta- 
tion very likely. First of all, a large number of our mutant 
proteins did not become glycosylated at all and apparently do 
not adopt a dual topology. Furthermore, glycosylation was 
invariably observed for those deletion mutant proteins in which 
the glycosylatable tag replaced a lumenal loop with an authen- 
tic glycosylation site. It seems more likely that the partial 
glycosylation of the proteins is inherent to the type of approach 
and relates to their very hydrophobic nature and their often 
drastically modified state, which makes them prone to aggre- 
gation when expressed out of their natural context. Further- 
more, we confirmed our results by performing the experiments 
with nsps of both SARS-CoV and MHYV, which yielded essen- 
tially identical results. 

We studied the topology and membrane integration of 
SARS-CoV and MHV hydrophobic nsps by expressing the 
proteins independently, i.e., out of their polyprotein context. 
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FIG. 9. Model of the membrane disposition of coronavirus ppla. The black line represents the protein, with the hydrophobic domains depicted as 
gray rectangles, and shows its localization with respect to the membrane. The proteases, papain-like protease (PLpro) and main protease (Mpro), and 
their cleavage sites (arrowheads) are indicated, with the PLpro cleavage sites in gray and the Mpro cleavage sites in black. The confirmed N glycosylation 
sites of the proteins are indicated by asterisks. In this model, the sixth hydrophobic domain of nsp6 does not span the lipid bilayer; however, we cannot 
exclude the possibility that the seventh rather than the sixth predicted transmembrane domain is not used as such. 


Similar strategies have been employed before to study the 
membrane integration of viral nsps derived from large precur- 
sor proteins (33, 34, 37). Unlike for nsp4 and nsp6, we ex- 
pressed only the C-terminal approximately 700 amino acids of 
nsp3, the part that contains all of the putative transmembrane 
domains. The full-size, approximately 2,000-amino-acid 
polypeptide would have been too large to detect the often 
subtle size differences inherent to our analytic approach. 
Moreover, it appeared difficult to express complete nsp3, pre- 
sumably due to the presence of toxic sequences, as have also 
been reported by others to occur in approximately the same 
region of other coronavirus genomes (1, 60, 61). The N-termi- 
nal region of nsp3 lacking in our expression constructs does not 
contain any appreciable hydrophobic domains, and prediction 
programs do not identify any transmembrane domains in this 
region. Hence, we believe that the models obtained for the 
partial proteins apply as well to complete MHV and SARS- 
CoV nsp3. 

By using biochemical and immunofluorescence assays, we 
demonstrated that in both SARS-CoV and MHYV, nsp3 has a 
Nendo/Cendo topology. While this is inconsistent with the 
predicted number of transmembrane domains (Fig. 1), it is in 
agreement with a previous study on the topology of MHV nsp3 
(28). A Nendo/Cendo topology obviously makes more sense 
than the predicted Nexo/Cendo orientation with respect to the 
localization of the PLpro domain and its substrates (Fig. 9). 
However, whereas previously four transmembrane domains 
were identified for MHV nsp3 (28), we found evidence for the 
presence of only two such domains in both of the nsps studied. 
We could not confirm the presence of additional membrane- 
spanning domains located immediately up- and downstream of 
the first transmembrane domain, nor were these additional 
domains predicted on the basis of the multiple alignment. Our 
study did confirm, however, the use of the previously identified 
N-linked glycosylation sites between the first and second trans- 
membrane domains (22, 28). It further demonstrated that in 
both SARS-CoV and MHYV, the second, but not the third 
predicted transmembrane domain of nsp3 is used. This result is 
in conflict with findings of Baker and coworkers, who identified 
the third rather than the second predicted transmembrane 
domain of MHV nsp3 as spanning the lipid bilayer (28). Their 
conclusion was mainly based on electrophoretic mobility mea- 
surements of hydrophobic nsp3 fragments obtained after pro- 
teinase K digestions. Hydrophobic proteins, including corona- 
virus nsp4 and nsp6, often exhibit anomalous migration in gels, 
and this might explain the different interpretations. Regardless 
of the number of transmembrane domains, it is striking that 


the first transmembrane domain in coronavirus ppla and 
pplab is found only after more than 2,000 amino acids, while 
the polyproteins and mature nsp3 lack an identifiable signal 
sequence at the amino-terminal end. Apparently, the first hy- 
drophobic domain of nsp3 is able to function as an internal 
signal sequence for membrane insertion, as we confirmed in 
our study. 

Like nsp4 and nsp3, nsp6 of SARS-CoV and MHV appeared 
to have a Nendo/Cendo membrane topology. This is again 
inconsistent with the prediction, which identified seven trans- 
membrane domains, compatible with a Nendo/Cexo topology. 
The experimentally established topology obviously makes 
more sense, as it positions all of the main proteinase cleavage 
sites on the same side of the membrane as the proteinase itself 
(Fig. 9). Demonstrating which of the hydrophobic domains in 
nsp6 actually served as transmembrane domains was not as 
straightforward as it was for nsp3. It appeared that, in princi- 
ple, all seven domains have the capacity to function as signal 
sequences and membrane anchors, irrespective of their fate in 
the full-length protein. The combined observations appear to 
fit best a model in which nsp6 contains six transmembrane 
domains, with the sixth or seventh hydrophobic domain not 
spanning the lipid bilayer. The transmembrane prediction pro- 
gram SOSUI (http://bp.nuap.nagoya-u.ac.jp/sosui/sosuimenu0 
-html) also predicts seven transmembrane-spanning helices but 
additionally calculates that the hydrophobicity of the sixth do- 
main, but not that of the others, is similar to the hydrophobic- 
ity of helices found in soluble proteins (23). This prediction 
hence supports the model in which the sixth putative trans- 
membrane domain does not span the lipid bilayer. 

Our experience with nsp3 and nsp6 prompted us to revisit 
also the membrane structure of nsp4. With four hydrophobic 
domains, the known Nendo/Cendo topology of this protein 
could, in principle, be achieved with either two or all of those 
as transmembrane domains. Our deletion assay led us to con- 
clude that all four domains are membrane spanning. Remark- 
ably, in MHV, the removal of the fourth domain was found by 
Sparks and coworkers (50) to yield a virus that grew in cultured 
cells with kinetics and to titers similar to those of the wild-type 
virus, although the kinetics of its RNA replication were signif- 
icantly delayed. Given the topological consequences of this 
deletion, one has to assume that in this mutant virus either the 
third hydrophobic membrane is not functioning as a trans- 
membrane domain or, as the authors suggest, the effect of the 
deletion is compensated by the region between the third and 
fourth transmembrane domains, which is also somewhat hy- 
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drophobic. Mutant viruses lacking any of the other three hy- 
drophobic domains of nsp4 could not be recovered. 

Combining all of the information obtained for nsp3, -4, and 
-6 into one model, we arrive at the membrane structure of the 
ppla precursor that is depicted in Fig. 9. As a result of the 
Nendo/Cendo topology of all three membrane-anchored nsps, 
the major part of the polyproteins, including all known func- 
tional domains, is located on the cytoplasmic side of the 
membrane. Of the entire approximately 4,400-amino-acid 
ppla precursor, only six relatively short stretches are exposed 
lumenally, exceptions being the glycosylated polypeptides oc- 
curring between the first and second transmembrane domains 
of nsp3 and nsp4, which comprise some 80 and 250 residues, 
respectively. Interestingly, both of these polypeptide loops con- 
tain several highly conserved cysteine residues. Mutations in 
the lumenally exposed part of MHV nsp4 (and its EAV coun- 
terpart) were shown to affect DMV formation (8, 39). 

Also, the pp1b precursor, which lacks membrane-spanning 
domains but carries cleavage sites processed by the nsp5-en- 
coded main protease, and its cleavage products nsp12 to -16 
end up cytoplasmically. This topological arrangement fits into 
a picture of the biogenesis of the replication complexes in 
which membrane proteins nsp3, -4 and -6, while still contained 
in the precursor, direct the formation of the membranous 
structures on which RNA replication is to take place. These 
proteins probably shape the platform onto which the subse- 
quently appearing nonanchored cleavage products derived 
from ppla and pplb remain associated and to which cellular, 
as well as viral (e.g., N protein), components may additionally 
attach to generate the functional replication complex. 

What role(s) the conserved non-membrane-spanning hydro- 
phobic domains in nsp3 and -6 play in the formation and func- 
tioning of the complex remains to be established. A cytoplasmi- 
cally exposed hydrophobic domain might serve as an interaction 
partner for other components of the complex, while the hydro- 
phobic non-membrane-spanning domains might also contribute 
to the double-membrane configuration, for instance, by dipping 
into the hydrophobic phase of the membrane. The latter has been 
observed earlier for proteins containing amphipathic helices, 
showing that these domains can act as a wedge in the membrane 
and thereby induce curvature (32, 65). 

The induction of membrane rearrangements to create sites 
for RNA replication and transcription in infected cells is not 
unique to coronaviruses but is common among plus strand 
RNA viruses. These viruses generally encode nsps that induce 
membrane rearrangements. These nsps can be integral mem- 
brane proteins, but this is not always the case. The occurrence 
of hydrophobic/amphipathic regions that do not span the bi- 
layer but are peripherally associated with membranes is not an 
exception; it rather is a common theme. Interestingly, dengue 
virus, which also replicates on DMVs (56), encodes two non- 
structural membrane proteins that, in addition to transmem- 
brane domains, also contain hydrophobic domains that do not 
span the lipid bilayer (33, 34). The NS4A and NS4B proteins of 
this virus contain one and two hydrophobic domains, respec- 
tively, that do not function as transmembrane domains in the 
full-length protein. Yet, the non-membrane-spanning hydro- 
phobic domain of NS4A is integrated into the membrane when 
expressed individually (33, 34), comparable to our observations 
with the hydrophobic domains of MHV nsp6. In the case of 
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hepatitis C virus, which, like dengue virus, also belongs to the 
flavivirus family, the amphipathic helix contained in NS4B was 
shown to be required for the formation of replication sites 
(15). Likewise, the poliovirus 2C protein, which is able to 
induce the membrane rearrangements associated with RNA 
replication, also contains an essential amphipathic domain (14, 
38, 54). In view of these observations, it is tempting to specu- 
late that these similarities among the viral nonstructural mem- 
brane-associated proteins relate to their common ability to 
induce membrane rearrangements. 
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